Taverna/myGrid: Aligning a Workflow System with the Life Sciences Community

نویسندگان

  • Tom Oinn
  • Peter Li
  • Douglas B. Kell
  • Carole Goble
  • Antoon Goderis
  • Mark Greenwood
  • Duncan Hull
  • Robert Stevens
  • Daniele Turi
  • Jun Zhao
چکیده

Bioinformatics is a discipline that uses computational and mathematical techniques to store, manage, and analyze biological data in order to answer biological questions. Bioinformatics has over 850 databases [154] and numerous tools that work over those databases and local data to produce even more data themselves. In order to perform an analysis, a bioinformatician uses one or more of these resources to gather, filter, and transform data to answer a question. Thus, bioinformatics is an in silico science. The traditional bioinformatics technique of cutting and pasting between Web pages can be effective, but it is neither scalable nor does it support scientific best practice, such as record keeping. In addition, as such methods are scaled up, slips and omissions are more likely to occur. A final human factor is the tedium of such repetitive tasks [397]. Doing these tasks programmatically is an obvious solution, especially for the repetitive nature of the tasks. Some bioinformaticians have the programming skills to wrap these distributed resources. Such solutions are, however, not easy to disseminate, adapt, and verify. Moreover, one of the consequences of the autonomy of bioinformatics service providers is massive heterogeneity within those resources. The advent of Web services has brought about a major change in the availability of bioinformatics resources from Web pages and command-line programs to Web services [395], though much of the structural, value-based, and syntactic heterogeneity remains. The consequent lack of a common type system means that services are difficult to join together programmatically, and any technical solution to in silico experiments in biology has to address this issue. Many scientific computing projects within the academic community have turned to workflows as a means of orchestrating complex tasks (in silico experiments) over a distributed set of resources. Examples include DiscoveryNet [373] for molecular biology and environmental data analysis,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Background – myGrid & Taverna

The myGrid project is a large scale investigation into the application of grid and semantic web technologies to the scientific domain, so called eScience. As a pilot project funded by the Engineering and Physical Sciences Research Council (EPSRC) in the UK we have performed research into, and developed prototype software for, various aspects of this convergence. This paper presents some of our ...

متن کامل

Taverna: lessons in creating a workflow environment for the life sciences

Life sciences research is based on individuals, often with diverse skills, assembled into research groups. These groups use their specialist expertise to address scientific problems. The in silico experiments undertaken by these research groups can be represented as workflows involving the co-ordinated use of analysis programs and information repositories that may be globally distributed. With ...

متن کامل

Association of variations in I kappa B-epsilon with Graves’ disease using classical and Grid methodologies

Bioinformatics experiments can be modelled as workflows whereby the order of each computational resource used has been pre-defined. Workflows in the Grid project are composed and enacted using the Taverna workflow system. We have compared the use of Taverna with classical approaches for performing bioinformatics experiments in the genetic analysis of Graves’ disease. Both classical and myGrid m...

متن کامل

Contextualised Workflow Execution in MyGrid

e-Scientists stand to benefit from tools and environments that either hide, or help to manage, the inherent complexity involved in accessing and making concerted use of the diverse resources that might be used as part of an in silico experiment. This paper illustrates the benefits that derive from the provision of integrated access to contextual information that links the phases of a problem-so...

متن کامل

A portal interface to myGrid workflow technology

A rich selection of computational resources are available to scientists working with biological data, and it is common for these scientists to wish to perform composite analyses which use a number of such resources. To support the automation of the performance of these analyses, the Grid project have developed the Taverna workflow workbench [2]. This is a graphical interface which allows a user...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007